Applicant: Alan R. Brooks, et al 
i ESTROGEN-REGULATED UNCONVENTIONAL t^^N-F 
1 PROTEIN: COMPOSITIONS AND METHODS^^oE 



1/25 



1 MGSLFQEAEP QAGTEQNKPT LASRFQQTLG DLLARLGSRG HVYVIHCLNP 
51 TPGKIPGLLD VGHVAEQLRQ AGILEIIGTR STHFPVRVSF QVFLARFHAL 
101 GSGRQKAASD QERCGAILSE VLGAESPLYH LGVTQVLLQE QGWQQLEQLW 
151 AQRRSQALLT LHRGLRACIT RQRLRLLPRM QARVRGLQAR KRYLQRRSAL 
201 GQLNTILLVA RPLLRRRQKL RCAPGPHSGE PWGKVSNMDL GRLEIPAQLA 
251 TLLERAEGHQ ALLTGSITES LPPEVPARPS LTLPPDIDQF PFSSFVSTSF 
301 QKPFLPRPGQ PLDEPLTRLD GENPQQALEI NRVMLRLLGE GSLQSWQEQT 
351 MGTFLVQQAQ RRPGLRDELF SQLVAQLWRN PDEQQNQRGW ALMVILLSSF 
401 APTPALEKPL LKFVSDQAPS GMAALCQHKL LGALEQTPLA PMASRSHPPT 
451 QLEWKAGLRR GRMALDVFTF NEESYSAEVE SWTTGEQFAG WILQSRGLEA 
501 PPRGWSVSLH SGDAWRDLPG CDFVLDLIGQ TEDLGDPAGP HNYPITPLGL 
551 AESIPPAPGV QAPSLPPGLP PGPAPILASS RPPGEASKPE NLDGFVDHLF 
601 EPALAPGFSD LEQGWALSRR MKGGGSVGPT QQGYPMVYPG MVQAPSYQPA 
651 MIPAPMPVMP AMGAVPTMPA MMVPPQPQPL VPSLDSRQLA LQQQNFINQQ 
701 AM I LAQQMTT QAMSLSLEQQ NQRHQHQAQT SGATSQPPPS TTAPKAKKPP 
751 APQEKPESNL EPSGVGLRED TPEEAESKPQ RPKSFQQKRD YFQKMGQDPI 
801 RVKTVKPPAK VQIPQEEMEE TEEEEDETAE LSPPPPPPPV VKKPLKASRP 
851 KAVKEDEAEP AQEEVPTQGE DPPVHSSNSA PQHPKPSRVP PVQSSNSAPP 
901 RPQPSREIRN IIRMYQSRPG PVAVPVQPTR PIKTFQKKND PKDEALAKLG 
951 INGVHLPLST SPNQGKSSPP AWPRPKARP RLEPSLSIQE KQGPLRDLFG 
1001 PCSPNPPTAP APPPPPALPP PLSGEPKTPS VESHALTEPM EDKNISTKLL 
1051 VPSGSVCFSY ANAPWKLFLR KEVFYPRENF SHPYCLSLLC QQILRDTFTE 
1101 SCTRISQDER HKMKGLLGDL EVSLETLDIV EDSIKKRIW AARDNWANYF 
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1151 SRIFPVSGES GSDVQLLGVS 

1201 VLTVQCRGRS TLELSLKNEQ 

1251 ALRSYITDDN SLLSFHRGDL 

1301 QPAAAPDLSF SLGKRNSWQR 



HRGLRLLKVT QSPSFHLDQL KTLCSYSYAE 

LILHTAWARA I KAMVDLFLS ELRKDSGYVI 

IRLLPVTALE PGWQFGSAGG RSGLFPDDW 
KSKLGPAQEV RKTEEVK* 
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1 CGCTGGGACT GTCACCTACC AGGTGCACAA GTTCATAAAC AGAAACAGGG 
51 GCCACCTGGA CCCCGCTGTG CTGGAGATGC TCAGGCAGAG CCAGCTGCAG 
101 GTGACCTAGC CTTCCTTTCA GCTCATGGGC AGCCTGTTCC AAGAAGCAGA 
151 GCCCCAGGCT GGGACTGAGC AAAACAAACC CACATTGGCC TCTCGATTCC 
201 AGCAGACCCT GGGTGACTTG CTAGCTCGGC TAGGCAGCAG GGGCCATGTC 
251 TACGTCATCC ACTGTCTCAA TCCCACCCCT GGAAAGATCC CAGGCCTCTT 
301 GGACGTGGGG CATGTGGCAG AGCAGCTGCG TCAGGCTGGC ATCCTGGAGA 
351 TCATAGGCAC CCGGAGTACC CACTTCCCCG TGCGAGTGTC CTTCCAAGTC 
4 01 TTTCTGGCAA GGTTCCATGC CCTGGGGTCA GGGAGACAGA AAGCTGCCTC 
451 TGACCAGGAG AGGTGTGGTG CCATCCTCAG TGAAGTGCTG GGGGCAGAGT 
501 CACCGCTGTA TCATCTTGGA GTCACCCAGG TCCTGCTGCA GGAACAGGGC 
551 TGGCAGCAGC TAGAACAGCT GTGGGCTCAG CGGCGCTCAC AGGCCCTGCT 
601 CACTCTGCAC CGTGGCCTCC GAGCCTGTAT CACCCGGCAG CGCCTCCGTC 
651 TCCTGCCCCG GATGCAGGCT CGTGTGCGTG GGCTCCAGGC CAGGAAGCGA 
701 TATCTCCAGC GGAGGTCAGC TCTGGGACAG CTGAACACCA TTCTCCTAGT 

7 51 GGCCCGGCCC CTGCTCCGGA GACGACAGAA GCTACGGTGT .GCCCCTGGCC 

8 01 CGCACAGCGG GGAGCCCTGG GGGAAAGTGT CAAATATGGA CCTGGGTCGC 

8 51 TTAGAGATCC CCGCCCAGCT GGCTACTCTG CTGGAGAGGG CGGAAGGCCA 
901 CCAGGCCTTG CT GACGGGGA GCATCACAGA GTCCCTGCCA CCTGAGGTCC 

9 51 CCGCCCGGCC CAGCCTGACT CTCCCTCCAG ACATTGACCA GTTTCCCTTC 
1001 TCCAGTTTTG TATCCACCAG CTTTCAGAAG CCATTTCTGC CTCGACCAGG 
10 51 GCAGCCACTG GACGAGCCCC TGACGCGGTT AGATGGCGAG AACCCTCAGC 

FIG. 2A. 
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1101 AGGCTCTGGA GATCAACAGG GTGATGCTGC GGCTCCTGGG GGAAGGATCT 
1151 CTGCAGTCCT GGCAAGAGCA GACCATGGGC ACGTTCCTCG TGCAGCAGGC 
1201 CCAGCGACGG CCGGGACTCC GAGATGAGCT CTTCAGCCAG CTGGTGGCCC 
1251 AGCTGTGGCG CAACCCAGAT GAGCAACAGA ATCAGCGTGG CTGGGCCCTA 
1301 ATGGTGATCC TGCTCAGCTC CTTTGCTCCC ACACCTGCCC TGGAGAAGCC 
1351 AGTGCTCAAA TTTGTATCTG ACCAGGCTCC CAGTGGCATG GCAGCCCTGT 
1401 GCCAGCACAA GCTGTTAGGT GCCCTGGAGC AGACACCGCT GGCTCCCATG 
1451 GCTTCGAGGT CCCACCCACC CACACAACTT GAGTGGAAGG CTGGTTTACG 
1501 TCGGGGCCGC ATGGCGCTGG ATGTGTTCAC ATTCAACGAG GAAAGCTACT 
155 1 CCGCGGAAGT GGAATCCTGG ACCACGGGAG AGCAGTTTGC AGGGTGGATC 
1601 CTACAGAGGA G AGGCCT GG A GGCGCCCCGT CGTGGCTGGT CTGTGTCACT 
165 1 GCATTCTGGG GATGCTTGGC GTGACTTGCC TGGCTGTGAC TTTGTGTTGG 
1701 ACCTAATAGG CCAGACTGAG GACTTGGGAG ACCCAGCTGG TCCCCACAAC 
1751 TACCCCATCA CTCCTCTTGG TTTAGCTGAG AGCATCCCTC CAGCGCCTGG 
1801 TGTCCAGGCT CGTTCCCTGC CCGCAGGACT CCCTCCAGGT CCAGCCCCAA 
1851 TACTGGCCAG CAGCCGCCCT CCGGGCGAGG CCAGTAAGCC TGAGAACCTG 
1901 GATGGTTTCG TGGACCACCT CTTTGAACCA GCGCTCGCTC CGGGTTTCAG 
1951 TGATCTGGAA CAAGGCTGGG CCCTGAGCAG ACGCATGAAG GGAGGGGGCT 
2001 CTGTTGGGCC CACGCAGCAG GGCTACCCCA TGGTGTACCC AGGTATGGTG 
2051 CAGGCACCTA GCTACCAGCC AGCTATGATA CCCGCACGGA TGCGCGTCAT 
2101 GCCAGCCATG GGCGCAGTCC CAACCATGCC AGCCATGATG GTGCCACCCC 
2151 AGGCAGAGCG TCTGGTGCCC AGTTTGGACT CAAGGCAGCT GGCACTACAG 
2201 CAGCAAAACT TCATCAACCA GCAGGCGATG ATTCTGGCGC AGCAGATGAC 
2251 CACCCAGGCC ATGAGGCTGT CCCTGGAGCA GCAGAATCAG AGACACGAGG 

FIG. 2B. 
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2301 ACCAAGCTCA GACCTCTGGG GCCACCTCCC AGCCTCCACC CTCAACCACT 
2 3 51 GCTCCCAAGG CCAAGAAGCC TCCTGCCCCC CAAGAGAAGC CAGAGAGTAA 
2 4 01 CCTAGAGCCT TCGGGTGTTG GCTTGAGAGA GGACACCCCA GAGGAAGCTG 
24 51 AAAGCAAGCC TCAGCGCCCC AAGAGCTTCC AACAGAAACG GGACTATTTC 
2501 CAGAAGATGG GGCAAGATCC GATCAGAGTG AAGACGGTGA AACCTCCAGC 
2551 CAAGGTTCAG ATCCCCCAAG AGGAGATGGA GGAGACGGAG GAGGAGGAGG 
2601 ATGAGACCGC CGAGTTGTCC CCTCCTCCTC CCCCTCCCCC GGTTGTGAAG 
2651 AAGCCGCTGA AGGCAAGCAG GCCCAAAGCC GTAAAGGAAG AT GAG G C AG A 
2701 GCCCGCCCAG GAGGAAGTAC CGACCCAGGG CGAGGATCCC CCGGTGCACA 
2751 GCTCCAACTC CGCACCTCAG CACCCCAAAC CCAGCAGGGT ACCCCCAGTG 
2801 CAGAGCTCCA ACTCCGCACC TCCACGCCCG CAACCCAGCA GGGAAATCCG 
2851 AAAC AT CAT C CGAATGTACC AGAGCCGTCC AGGGCCTGTG GCTGTGCCCG 
2 901 TACAACCCAC CAGGCCCATC AAAACTTTTC AGAAGAAAAA TGACCCTAAG 
2 951 GAT G AGGCT T TGGCTAAGTT AGGGATAAAT GGCGTCCACT TGCCCCTATC 
3001 GACATCGCCT AACCAAGGGA AGAGCTCTCC ACCGGCTGTA GTTCCTCGAC 
3051 CTAAGGCTCG ACCTCGTCTT GAGCCTTCCC TATCCATCCA GGAAAAGCAG 
3101 GGACCCCTTC GGGACTTGTT TGGCCCATGT AGTCCAAACC CACCTACAGC 
3151 TCCAGCACCC CCGCCTCCAC CAGCACTCCC ACCGCCTCTG TCTGGGGAGC 
3201 CCAAGACCCC TTCAGTGGAG TCTCATGCCT TGACAGAGCC CATGGAGGAC 
3251 AAGAACATCT CCACAAAGCT CCTTGTGCCC TCTGGAAGTG TGTGCTTCTC 
3 301 CTATGCCAAT GCACCCTGGA AGTTGTTCTT ACGCAAGGAG GTGTTCTACC 
3 3 51 CCCGGGAGAA CTTCAGTCAT CCATACTGCC TCAGTCTCCT CTGCCAGCAG 
3401 ATCCTGCGGG ACACCTTCAC AGAGTCCTGC ACCCGGATCT CACAGGATGA 

FIG. 2C. 
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34 51 GCGGCACAAA ATGAAAGGCC TTCTGGGAGA CTTGGAGGTG AGTCTGGAGA 
3501 CCCTTGACAT TGTTGAAGAC AGCATCAAAA AACGCATCGT GGTCGCTGCT 

35 51 CGGGACAACT GGGCCAATTA CTTCTCCCGC ATCTTCCCAG TCTCGGGTGA 
3 601 GAGTGGCAGC GATGTACAGC TGCTGGGTGT GTCTCACCGG GGACTGCGGC 
3 651 TGCTGAAGGT GACCCAAAGC CCGAGCTTCC ACCTGGACCA GCTGAAGACA 
37 01 CTCTGTTCCT ACAGCTATGC TGAAGTCCTG ACCGTGCAGT GCAGGGGCAG 

37 51 ATCCACCCTG GAGCTGTCCT TGAAGAATGA GCAGCTGATA CTGCACACAG 

38 01 CCTGGGCGAG GGCCATCAAG GCCATGGTGG ATCTATTTCT GAGTGAACTC 

3 8 51 AGGAAGGACT CCGGCTATGT CATCGCCCTG CGCAGCTACA TCACCGATGA 
3901 CAATAGCCTC CTCAGTTTCC ACCGTGGGGA CCTCATTAGG TTACTGCCAG 
3951 TGACCGCTCT GGAACCAGGC TGGCAGTTCG GTTCTGCCGG GGGCCGCTCC 

4 001 GGACTCTTTC CCGATGACGT GGTGCAGCCA GCTGCTGCCC CCGACCTCTC 
4 0 51 CTTTTCCCTG GGAAAGAGAA ACAGCTGGCA ACGCAAGAGT AAGCTGGGGC 
4101 CAGCTCAGGA GGTGAGGAAG ACAGAAGAGG TGAAGTGATA CAGGCCTAAC 
4151 TTGGAGACTG AGAAGGAAAG AGCAGGGTTG CTTCGGGTGT TGTCCACTTC 
4201 CTGTCCTGGT GGCCAGGGCT CAATGTGTTC CTGTCCTTTA CCATCTCCTG 
4 251 ACTTTTTGCC ATTTGTGAGA CTGTAAGTCA CACCCTCTAA CTCTGGTACT 
4 301 TAGTTCAGTG TCTCCATAGA GGATGCTTAA TAAATAACCT "TGGTTTTCCT 
4 351 GGTTTCTGGT GTCACTCCTC TTGGGTCTAA TGGGTATGGG GACCAGGGCC 
4 401 TGAGAGTGAG TATTGGGCCT CTGGGCTAGA TGGTGGGTAC TGGGGTGGTA 
4 4 51 CCAAATTTCC TGTGCTCCCA GCGCCCCACC CATCCCAGGA AACAAGAACC 
4501 C AGT GAAGAC TCGGAGGCCA CCTCCTTTAC AACCTACAGC TCTTTGTCTG 
4 551 CCGACCCCCA CAACTACACC ATGCAGGAAT TTGCCCTGCG CTATTTCCGG 
4 601 AAGCCTCATA CCTGGCTGAC CCAGATGAGT AGAGACACCA AAGAGAAAGC 

FIG. 2D. 
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4 651 TGCCATCAAC CTGATCCAGT ACACTAAGGA CCCCATCCAG GAATCCCTTA 
47 01 CCAGCTTCTG CAATGGGGAC ACAAACAGTA AAGCTGTGGC TGGCTTCAAG 
47 51 GCTCTGATGC AGTTTATGGG GGACCAGCCT AAGCCCCGGG GCAAGGACGA 
4801 GCTGAGTCTG CTCTATGAGC TGCTGAAGCT GTGCCAAGAT GACCTTAGGG 
4 8 51 ACGAGATGTA CTGCCAGGTC ATCAAGCAAG TCACAGGACA CCCCCAGCCA 
4901 AAGCACTGTG CTCTGGGCTG GAGCGTCCTC AGCCTCTTCA CAGGCTTCTT 
4 951 TGCACCATCG ACCACGCTGA TGCCCTATGT GACCAAGTTC CTGCAGGATT 
5001 CCAGCCCCAG TGAAGAGTTG GCCAGGAGGA GCCAGGAGAA CCTCCAGCGC 
5051 ACAGTTAAAT ATGGGGGACG CCAGCAGCTG CCGTTACCTG GTGAAATGAA 
5101 TGCTTTTCTG AAAGGGCAAG CAGTTCGTTT GCTTCTAATT CACCTGCCTG 
5151 GGGGTGTGGA CTACAGGACG AATTCACAGA CATTCACAGT GGCAGGGGAA 
5201 GTGCTAGAGG AGCTGTGTGG ACAGATGGGC AT C AC AGACT TGGAAGAAGT 
5251 GCAGGAATTT GCCCTCTTTC TCATCAAAGG AG AAGGT GAG CTGGTTCGGC 
5301 CGCTGTCACC CCATGAGTAC ATCAACAATG TGGTGACGGA C C AGG AC AT G 
5351 AGCCTTCACA GCCGACGGCT TGGTTGGGAG ACTCCACTGC ATTTTGATCA 
5401 CTCCACCTAC ACGGAAACCC ACTATGGCCA GGTGCTTCGG GACTACCTGC 
5 4 51 AAGGGAAGCT GATAGTCAGC ACCCAGGCAG AGGCTCTACT .TGCCCAGCTT 
5501 GCTGCCTTCC AACACTTCGA CAAAACCGGA ACTTCTAGTC CTCCATCAGA 
5551 GCAAGAGCTG CTGTCTTATA TTCCCAAGCC ACTGCAATGG CAGGTGAACA 
5 601 CAGCCAACAT AAAGAGCTTG GTGACCCAGG AGCTGAGGCA GATGCAAGGG 
5 651 TACAGCAAGC AGAGAGCACA GATTGGCTTT ATAGAGAGCA CAGCGCAGCT 
57 01 GCCCCTCTTT GGCTACACTG TGTACGTAGT GCTGAGAGTG AGTAAGCTGG 
57 51 CCCTCCCTGG ACCAGGCCTC CTGGGGCTGA ACCGTCAGCA CCTGGTCCTC 

FIG. 2E. 
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5801 ATGGACCCCA GCTCTCAGGA ACTCTGCTGC TCTGTCATGC TAAAAGACCT 

58 51 GAAGCAGTTC CACCTGCTGA GCCCACTGCA GGAGGACGGG CCCCCTGGCC 
5901 TAGAACTCAA CTATGGCTCT GTTGACAACC CCCAGACCAT CTGGTTGGAG 

59 51 TTGCCACAGG CCCAGGAGCT GCAGCACACC ATCATCTTCC TGCTGGGCAG 
6001 CATGTCCACT CAGTGGCCAG GTCTCCTCTG AGGAGTGGAG ATAAGGCAGC 
6051 GGTCTCTCAC TGGGCAGTCT GCCTTAGTCC TGCTCTGAAT CCGCTGCACA 
6101 ACCCCCCACC CCACGTGGAG GCCAAAAGGC AAAGTTGTGT CACCTGGGAG 
6151 AATAGGCAGA CACATCCCCT CTGGGGTGGA CTGCAACAGG AGTTGGGGCA 
6201 TTTGCTGGCT AGCCCCAGGG AAAATGCCCA CCCAGCTCGA AAGCGGCACA 
6251 AGTAAAACAC CCAAGGAAAA AAAAAAAAAA AAAAAAAAAA AAA 

FIG. 2F. 



+ 



Application No.: 09/803,126 
Applicant: Alan R. Brooks, et al 

rTROG EN-REGULATED UNCONVENTIONAL MY| 
PROTEIN- COMPOSITIONS AND METHODS OP 
Sheet 9 of 25 



9/25 



+ 



1 CGCTGGGACT GTCACCTACC AGGTGCACAA GTTCATAAAC AGAAACAGGG 
51 GCCACCTGGA CCCCGCTGTG CTGGAGATGC TCAGGCAGAG CCAGCTGCAG 
101 GTGACCTAGC CTTCCTTTCA GCTCATGGGC AGCCTGTTCC AAGAAGCAGA 
151 GCCCCAGGCT GGGACTGAGC AAAACAAACC CACATTGGCC TCTCGATTCC 
201 AGCAGACCCT GGGTGACTTG CTAGCTCGGC TAGGCAGCAG GGGCCATGTC 
251 TACGTCATCC ACTGTCTCAA TCCCACCCCT GGAAAGATCC CAGGCCTCTT 
301 GGACGTGGGG CATGTGGCAG AGCAGCTGCG TCAGGCTGGC AT CCT GGAG A 
351 TCATAGGCAC CCGGAGTACC CACTTCCCCG TGCGAGTGTC CTTCCAAGTC 
401 TTTCTGGCAA GGTTCCATGC CCTGGGGTCA GGGAGACAGA AAGCTGCCTC 
4 51 TGACCAGGAG AGGTGTGGTG CCATCCTCAG TGAAGTGCTG GGGGCAGAGT 
501 CACCGCTGTA TCATCTTGGA GTCACCCAGG TCCTGCTGCA GGAACAGGGC 
551 TGGCAGCAGC TAGAACAGCT GTGGGCTCAG CGGCGCTCAC AGGCCCTGCT 
601 CACTCTGCAC CGTGGCCTCC GAGCCTGTAT CACCCGGCAG CGCCTCCGTC 
651 TCCTGCCCCG GATGCAGGCT CGTGTGCGTG GGCTCCAGGC CAGGAAGCGA 
7 01 TATCTCCAGC GGAGGTCAGC TCTGGGACAG CTGAACACCA TTCTCCTAGT 

7 51 GGCCCGGCCC CTGCTCCGGA GACGACAGAA GCTACGGTGT GCCCCTGGCC 
801 CGCACAGCGG GGAGCCCTGG GGGAAAGTGT CAAATATGGA CCTGGGTCGC 

8 51 TTAGAGATCC CCGCCCAGCT GGCTACTCTG CTGGAGAGGG CGGAAGGCCA 
901 CCAGGCCTTG CTGACGGGGA GCATCACAGA GTCCCTGCCA CCTGAGGTCC 
951 CCGCCCGGCC CAGCCTGACT CTCCCTCCAG ACATTGACCA GTTTCCCTTC 

1001 TCCAGTTTTG TATCCACCAG CTTTCAGAAG CCATTTCTGC CTCGACCAGG 
1051 GCAGCCACTG GACGAGCCCC TGACGCGGTT AGATGGCGAG AACCCTCAGC 

FIG. 3A. 
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1101 AGGCTCTGGA GATCAACAGG GTGATGCTGC GGCTCCTGGG GGAAGGATCT 
1151 CTGCAGTCCT GGCAAGAGCA GACCATGGGC ACGTTCCTCG TGCAGCAGGC 
1201 CCAGCGACGG CCGGGACTCC GAGATGAGCT CTTCAGCCAG CTGGTGGCCC 
1251 AGCTGTGGCG CAACCCAGAT GAGCAACAGA ATCAGCGTGG CTGGGCCCTA 
1301 ATGGTGATCC TGCTCAGCTC CTTTGCTCCC ACACCTGCCC TGGAGAAGCC 
1351 ACTGCTCAAA TTTGTATCTG ACCAGGCTCC CAGTGGCATG GCAGCCCTGT 
14 01 GCCAGCACAA GCTGTTAGGT GCCCTGGAGC AGACACCGCT GGCTCCCATG 
1451 GCTTCGAGGT CCCACCCACC CACACAACTT GAGTGGAAGG CTGGTTTACG 
1501 TCGGGGCCGC ATGGCGCTGG ATGTGTTCAC ATTCAACGAG GAAAGCTACT 
1551 CCGCGGAAGT GGAATCCTGG ACCACGGGAG AGCAGTTTGC AGGGTGGATC 
1601 CTACAGAGCA GAGGCCTGGA GGCGCCCCCT CGTGGCTGGT CTGTGTCACT 
1651 GCATTCTGGG GATGCTTGGC GTGACTTGCC TGGCTGTGAC TTTGTGTTGG 
17 01 ACCTAATAGG CCAGACTGAG GACTTGGGAG ACCCAGCTGG TCCCCACAAC 

17 51 TACCCCATCA CTCCTCTTGG TTTAGCTGAG AGCATCCCTC CAGCCCCTGG 

18 01 TGTCCAGGCT CCTTCCCTGC CCCCAGGACT CCCTCCAGGT CCAGCCCCAA 
18 51 TACTGGCCAG CAGCCGCCCT CCGGGCGAGG CCAGTAAGCC TGAGAACCTG 
1901 GATGGTTTCG TGGACCACCT CTTTGAACCA GCGCTCGCTC CGGGTTTCAG 
1951 TGATCTGGAA CAAGGCTGGG CCCTGAGCAG ACGCATGAAG GGAGGGGGCT 
2001 CTGTTGGGCC CACCCAGCAG GGCTACCCCA TGGTGTACCC AGGTATGGTG 
2 0 51 CAGGCACCTA GCTACCAGCC AGCTATGATA CCCGCACCGA TGCCCGTCAT 
2101 GCCAGCCATG GGCGCAGTCC CAACCATGCC AGCCATGATG GTGCCACCCC 
2151 AGCCACAGCC TCTGGTGCCC AGTTTGGACT CAAGGCAGCT GGCACTACAG 
2 201 CAGCAAAACT TCATCAACCA GCAGGCGATG ATTCTGGCGC AGCAGATGAC 
22 51 CACCCAGGCC ATGAGCCTGT CCCTGGAGCA GCAGAATCAG AGACACCAGC 

FIG. 3B. 
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2 301 ACCAAGCTCA GACCTCTGGG GCCACCTCCC AGCCTCCACC CTCAACCACT 

2 351 GCTCCCAAGG CCAAGAAGCC TCCTGCCCCC CAAGAGAAGC CAGAGAGTAA 

2 4 01 CCTAGAGCCT TCGGGTGTTG GCTTGAGAGA GGACACCCCA GAGGAAGCTG 

24 51 AAAGCAAGCC TCAGCGCCCC AAGAGCTTCC AACAGAAACG GGACTATTTC 

2 501 CAGAAGATGG GGCAAGATCC GATCAGAGTG AAGACGGTGA AACCTCCAGC 

2551 CAAGGTTCAG ATCCCCCAAG AG GAG AT GGA GGAGACGGAG GAGGAGGAGG 

2 601 ATGAGACCGC CGAGTTGTCC CCTCCTCCTC CCCCTCCCCC GGTTGTGAAG 

2 651 AAGCCGCTGA AGGCAAGCAG GCCCAAAGCC GTAAAGGAAG ATGAGGCAGA 

27 01 GCCCGCCCAG GAGGAAGTAC CGACCCAGGG CGAGGATCCC CCGGTGCACA 

27 51 GCTCCAACTC CGCACCTCAG CACCCCAAAC CCAGCAGGGT ACCCCCAGTG 

28 01 CAGAGCTCCA ACTCCGCACC TCCACGCCCG CAACCCAGCA GGGAAATCCG 
28 51 AAACATCATC CGAATGTACC AGAGCCGTCC AGGGCCTGTG GCTGTGCCCG 
2 901 TACAACCCAC CAGGCCCATC AAAACTTTTC AGAAGAAAAA TGACCCTAAG 
2 951 GATGAGGCTT TGGCTAAGTT AGGGATAAAT GGCGTCCACT TGCCCCTATC 
3001 GACATCGCCT AACCAAGGGA AGAGCTCTCC ACCGGCTGTA GTTCCTCGAC 
3051 CTAAGGCTCG ACCTCGTCTT GAGCCTTCCC TATCCATCCA GGAAAAGCAG 
3101 GGACCCCTTC GGGACTTGTT TGGCCCATGT AGTCCAAACC .CACCTACAGC 
3151 TCCAGCACCC CCGCCTCCAC CAGCACTCCC ACCGCCTCTG TCTGGGGAGC 
32 01 CCAAGACCCC TTCAGTGGAG TCTCATGCCT TGACAGAGCC CAT GGAGGAC 

32 51 AAGAACATCT CCACAAAGCT CCTTGTGCCC TCTGGAAGTG TGTGCTTCTC 

33 01 CTATGCCAAT GCACCCTGGA AGTTGTTCTT ACGCAAGGAG GTGTTCTACC 
3351 CCCGGGAGAA CTTCAGTCAT CCATACTGCC TCAGTCTCCT CTGCCAGCAG 
3401 ATCCTGCGGG ACACCTTCAC AGAGTCCTGC ACCCGGATCT CACAGGATGA 



FIG. 3C. 
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34 51 GCGGCACAAA ATGAAAGGCC TTCTGGGAGA CTTGGAGGTG AGTCTGGAGA 
3 501 CCCTTGACAT TGTTGAAGAC AGCATCAAAA AACGCATCGT GGTCGCTGCT 
3551 CGGGACAACT GGGCCAATTA CTTCTCCCGC ATCTTCCCAG TCTCGGGTGA 
3 601 GAGTGGCAGC GATGTACAGC TGCTGGGTGT GTCTCACCGG GGACTGCGGC 
3 651 TGCTGAAGGT GACCCAAAGC CCGAGCTTCC ACCTGGACCA GCTGAAGACA 
3701 CTCTGTTCCT ACAGCTATGC TGAAGTCCTG ACCGTGCAGT GCAGGGGCAG 

37 51 ATCCACCCTG GAGCTGTCCT TGAAGAATGA GCAGCTGATA CTGCACACAG 

38 01 CCTGGGCGAG GGCCATCAAG GCCATGGTGG ATCTATTTCT GAGTGAACTC 
38 51 AGGAAGGACT CCGGCTATGT CATCGCCCTG CGCAGCTACA TCACCGATGA 

3 901 CAATAGCCTC CTCAGTTTCC ACCGTGGGGA CCTCATTAGG TTACTGCCAG 
3951 TGACCGCTCT GGAACCAGGC TGGCAGTTCG GTTCTGCCGG GGGCCGCTCC 
4001 GGACTCTTTC CCGATGACGT GGTGCAGCCA GCTGCTGCCC CCGACCTCTC 

4 051 CTTTTCCCTG GGAAAGAGAA ACAGCTGGCA ACGCAAGAGT AAGCT GGGGC 
4101 CAGCTCAGGA GGTGAGGAAG ACAGAAGAGG TGAAGTGATA CAGGCCTAAC 
4151 TTGGAGACTG AGAAGGAAAG AGCAGGGTTG CTTCGGGTGT TGTCCACTTC 
4201 CTGTCCTGGT GGCCAGGGCT CAATGTGTTC CTGTCCTTTA CCATCTCCTG 
4251 ACTTTTTGCC ATTTGTGAGA CTGTAAGTCA CACCCTCTAA CTCTGGTACT 

4 301 TAGTTCAGTG TCTCCATAGA GGATGCTTAA TAAATAACCT TGGTTTTCCT 

4 351 GGAAAAAAAA AAAAAAAAAA AAAAA 



FIG. 3D. 
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CGGCAGCAGCAGGCTCGGGCCTCCGAGGCTGCGTCCCAGGCCTCACCCTCAGCCGTCACCTCCAAG 

CCCAGGAAGCCCCCCACACCCCCGGAGAAGCCACAGCGTGACCTGGGATCAGAGGGTGGCTGCCTG 

AGGGAGACCTCCGAGGAGGCTGAAGACAGGCCCTATCAGCCCAAGAGCTTCCAGCAGAAACGGAAC 

TATTTCCAGAGGATGGGGCAGCCACAGATCACAGTGAGGACGATGAAGCCCCCGGCCAAGGTCCAC 

ATCCCCCAGGGGGAAGCGCAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGCAGGAGGAGCAA 

GAAGTGGAAACAAGAGCAGCGCCGTCCCCTCCTCCTCCCCCCATCGTGAAGAAGCCATTGAAGCAA 

GGTGGGGCCAAAGCTCCAAAAGAGGCTGAGGCTGAGCCAGCCAAGGAGACAGCGGCCAAGGGCCAT 

GGCCAAGGGCCAGCCCAAGGCAGGGGGACTGTGGTGCGCAGTCAGACTCCAAGCCCAAGCGGCCAC 

AACCCAGCAGGGAAATTGGCAACATCATCCGCATGTACCAGAGCCGCCCGGGCCCCGTGCCTGTGC 

CCGTGCAGCCATCCAGGCCTCCCAAAGCTTTCCTGAGGAAAATCGACCCCAAGGACGAGGCTCTGG 

CCAAGCTGGGTATCAACGGTGCCCACTCGTCCCCGCCGATGCTGTCCCCCAGCCCAGGAAAGGGCC 

CCCCGCCAGCTGTGGCTCCTCGACCCAAGGCCCCGCTACAGCTTGGGCCCTCTAGCTCCATCAAGG 

AAAAGCAGGGGCCCCTTCTGGACCTGTTTGGCCAGAAGCTGCCTATTGCCCACACACCCCCACCTC 

CACCAGCGCCACCACTGCCTCTGCCCGAGGACCCAGGGACCCTTTCAGCAGAGCGTCGTTGCTTGA 

CACAGCCCGTGGAGGACCAGGGGGTCTCCACCCAGCTACTCGCGCCCTCTGGCAGCGTGTGCTTCT 

CCTACACCGGCACGCCCTGGAAGTTGTTCCTACGCAAGGAGGTGTTCTACCCACGGGAGAACTTCA 

GCCATCCCTACTACCTGAGGCTCCTCTGTGAGCAGATCCTACGGGACACCTTCTCCGAGTCCTGTA 

TCCGGATTTCCCAGAATGAGCGGCGGAAAATGAAAGACCTGCTGGGAGGCTTGGAGGTGGACCTGG 

ATTCTCTCACCACCACCGAAGACAGCGTCAAGAAGCGCATCGTGGTGGCCGCTCGGGACAACTGGG 

CCAATTACTTCTCCCGCTTCTTTCCTGTCTCGGGCGAGAGTGGCAGCGACGTGCAGCTGTTAGCCG 

TGTCCCACCGTGGGCTGCGACTGCTCAAGGTGACCCAAGGCCCCGGCCTCCGCCCCGACCAGCTGA 

AGATTCTCTGCTCATACAGCTTTGCGGAGGTGCTGGGTGTGGAGTGCCGGGGCGGCTCCACCCTGG 

AGCTGTCACTGAAGAGCGAGCAGCTGGTGCTGCACACAGCCCGGGCAAGGGCCATCGAGGCGCTGG 

TTGAGCTATTCCTGAATGAGCTTAAGAAGGACTCCGGCTATGTCATCGCCCTGCGCAGCTACATCA 

CTGACAACTGCAGCCTCCTCAGCTTCCACCGTGGGGACCTCATCAAGCTGCTGCCGGTGGCCACCC 

TGGAGCCAGGCTGGCAGTTTGGCTCTGCCGGGGGCCGTTCCGGACTCTTTCCTGCCGACATAGTGC 

AGCCGGCTGCCGCTCCCGACTTTTCCTTCTCCAAGGAGCAGAGGAGTGGCTGGCACAAGGGTCAGC 

TGTCCAACGGGGAACCAGGGCTGGCTCGGTGGGACAGGGCCTCAGAGGTGAGGAAGATGGGAGAGG 

GACAAGCAGAGGCAAGGCCTGCCTGAGACTGAGGAAGGAAAGGGGTTTGACCACTCCCGAGGCTGC 

CATGCGGTGGGACCACCCTGCTGTCCGTCTCCTGTGGCTGCCCCTCTGCCCGCTCCTGATGGCTCG 

CCTTGTCTCTCCAGCAAGACTGTGCACTCCTTGCAGGCAGGGGCTGGGCTGGATGCTGCTCTTGTG 

FIG. 5A. 
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TCCCACGTGGTACTTAGTTCAAGGCTGCCCCAGCAGATGCTTAATAAACAGCTCTTCACTTTCCTG 

GCTTCTGGTCTTGCTCCTTTGGTGTCTGGCTGGGGAGGGATGGGGCTGGGGCAGGACCCCTGGGAC 

AGGGCACTGGACACTCAGGTGGCACCAGGTTTCTTGTGATCCCAGCGCCCTGCCCACCCTTGGAGC 

CAGGCACACAGTGACGACTCGGAGGCCACCAGCCTGTCCTCTGTGGCCTATGCCTTTCTGCCCGAC 

TCCCACAGCTACACCATGCAGGAATTCGCCCGGCGTTACTTCCGGAGGTCCCAGGCCTTGCTGGGC 

CAGACTGATGGAGGTGCCGCAGGAAAGGACACGGACAGCCTGGTGCAGTACACCAAGGCTCCCATC 

CAGGAGTCGCTCCTCAGCCTCAGTGATGATGTGAGCAAGCTGGCTGTAGCCAGCTTCCTGGCCCCT 

GATGCGGTTTATGGGTGACCAGTCCAAGCCCCGGGGCAAGGATGAGATGGATCTGCTCTATGAACT 

GCTGAAGCTGTGCCAGCAGGAGAAGCTGAGGGATGAGATTTACTGCCAGGTTATCAAGCAGGTCAC 

AGGACACCCCCGGCCGGAACACTGCACTCGAGGCTGGAGCTTCCTCAGCCTTCTCACAGGCTTCTT 

CCCCCCGTCGACCAGGCTGATGCCCTACCTGACCAAGTTTCTGCAGGATTCAGGCCCCAGCCAAGA 

GCTGGCCCGGAGCAGCCAGGAGCACCTCCAGCGCACAGTCAAATATGGGGGGCGCCGGCGGATGCC 

CCCACCGGGTGAAATGAAGGCTTTCCTGAAAGGACAAGCGATTCGCCTGCTTCTTATTCACCTGCC 

GGGGGGTGTGGATTATAGGACGAATATCCAGACTTTCACAGTAGCAGCAGAAGTGCAGGAGGAGCT 

GTGCCGGCAAATGGGTATCACGGAGCCTCAGGAAGTGCAGGAATTCGCCCTCTTCCTCATCAAAGA 

GAAGAGCCAGCTGGTGCGGCCCCTGCAGCCCGCCGAATACCTCAACAGCGTGGTAGTGGACCAGGA 

CGTGAGCCTGCACAGCCGGCGGCTCCACTGGGAGACCCCACTGCACTTCGATAACTCCACCTACAT 

CAGCACCCACTACAGCCAGGTGCTGTGGGACTACCTTCAGGGGAAGCTGCCAGTCAGCGCCAAGGC 

AGACGCGCAGCTCGCCAGGCTGGCCGCCCTGCAGCACCTCAGCAAGGCCAACAGGAATACCCCCTC 

AGGGCAGGACCTGCTAGCTTACGTGCCAAAGCAGCTGCAACGGCAGGTGAACACGGCCTCCATCAA 

GAACCTGATGGGTCAGGAGCTGAGACGGCTGGAAGGACACAGCCCCCAGGAAGCACAGATCAGCTT 

CATTGAGGCCATGAGCCAGCTGCCCCTCTTCGGCTACACCGTCTATGGGGTGCTGCGAGTGAGCAT 

GCAGGCCCTGTCCGGACCCACTCTCCTGGGGCTCAACCGCCAGCATCTCATCCTCATGGACCCCAG 

CTCCCAGAGCCTGTACTGCCGCATTGCCCTGAAGAGCCTGCAGCGGCTCCACCTGCTAAGCCCTCT 

GGAGGAGAAGGGGCCCCCTGGCCTGGAAGTCAACTATGGCTCAGCTGACAACCCCCAGACCATCTG 

GTTTGAGCTGCCACAGGCCCAGGAGCTGCTATACACCACTGTCTTCCTGATAGACAGCAGTGCCTC 

TTGCACTGAGTGGCCCAGCATCAACTGAGAGGAGTGCAGGCCGGGGAGAGAAGAGGATGAGGCCTC 

CCCCGGCCCAAGTCTCACCCACATGGTCTGCCTTGGATGCTATCAGATCACTGTTCTAGAACCTGC 

CTCAGCACAGCCCAGCCGGCCCACATGCAGGCCATGAGGCAGGGGCTGCTATCACGTCACCAGCAG 

GCAAAGAAAACAGCCAGACCCTCTCCAGGACGGCCTGGGGCCAAAGCGGGCTGCAGGAACTCGGCT 

GGGGCACCTGAGGTTGCCCAGTCTGAGGGAGATGCCCACCCGACCCCAGGCTCCGCCCAGGCCCCA 

FIG. 5B. 
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CATTAGCACAAGCCCAGGCATGGGAGAAACAGCTGCTGAGGAAATAAAACTCCCTAAAAAAAAAAA 
AAAAAAAAAAAAAAAA 

FIG. 5C. 
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MYQSRPGPVPVPVQPSRPPKAFLRKIDPKDEALAKLGINGAHSSPPMLSPSPGKGPPPAVAPRPKA 

PLQLGPSSSIKEKQGPLLDLFGQKLPIAHTPPPPPAPPLPLPEDPGTLSAERRCLTQPVEDQGVST 

QLLAPSGSVCFSYTGTPWKLFLRKEVFYPRENFSHPYYLRLLCEQILRDTFSESCIRISQNERRKM 

KDLLGGLEVDLDSLTTTEDSVKKRIWAARDNWANYFSRFFPVSGESGSDVQLLAVSHRGLRLLKV 

TQGPGLRPDQLKILCSYSFAEVLGVECRGGSTLELSLKSEQLVLHTARARAIEAIiVELFLNELKKD 

SGYVIALRSYITDNCSLLSFHRGDLIKLLPVATLEPGWQFGSAGGRSGLFPADIVQPAAAPDFSFS 

KEQRSGWHKGQLSNGEPGLARWDRASERPAHPWSQAHSDDSEATSLSSVAYAFLPDSHSYTMQEFA 

RRYFRRSQALLGQTDGGAAGKDTDSLVQYTKAPIQESLLSLSDDVSKLAVASFLALMRFMGDQSKP 

RGKDEMDLLYELLKLCQQEKLRDEIYCQVIKQVTGHPRPEHCTRGWSFLSLLTGFFPPSTRLMPYL 

TKFLQDSGPSQELARSSQEHLQRTVKYGGRRRMPPPGEMKAFLKGQAIRLLLIHLPGGVDYRTNIQ 

TFTVAAEVQEELCRQMGITEPQEVQEFALFLIKEKSQLVRPLQPAEYLNSVWDQDVSLHSGGSTG 

RPHCTSITPPTSAPTTARCCGTTFRGSCQSAPRQTRSSPGWPPCSTSARPTGIPPQGRTC 



FIG. 6. 
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CGGCAGCAGCAGGCTCGGGCCTCCGAGGCTGCGTCCCAGGCCTCACCCTCAGCCGTCACCTCCAAG 

CCCAGGAAGCCCCCCACACCCCCGGAGAAGCCACAGCGTGACCTGGGATCAGAGGGTGGCTGCCTG 

AGGGAGACCTCCGAGGAGGCTGAAGACAGGCCCTATCAGCCCAAGAGCTTCCAGCAGAAACGGAAC 

TATTTCCAGAGGATGGGGCAGCCACAGATCACAGTGAGGACGATGAAGCCCCCGGCCAAGGTCCAC 

ATCC CCCAGGGGGAAGCGCAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGCAGGAGGAGCAA 

GAAGTGGAAACAAGAGCAGCGCCGTCCCCTCCTCCTCCCCCCATCGTGAAGAAGCCATTGAAGCAA 

GGTGGGGCCAAAGCTCCAAAAGAGGCTGAGGCTGAGCCAGCCAAGGAGACAGCGGCCAAGGGCCAT 

GGCCAAGGGCCAGCCCAAGGCAGGGGGACTGTGGTGCGCAGTCAGACTCCAAGCCCAAGCGGCCAC 

AACCCAGCAGGGAAATTGGCAACATCATCCGCATGTACCAGAGCCGCCCGGGCCCCGTGCCTGTGC 

CCGTGCAGCCATCCAGGCCTCCCAAAGCTTTCCTGAGGAAAATCGACCCCAAGGACGAGGCTCTGG 

CCAAGCTGGGTATCAACGGTGCCCACTCGTCCCCGCCGATGCTGTCCCCCAGCCCAGGAAAGGGCC 

CCCCGCCAGCTGTGGCTCCTCGACCCAAGGCCCCGCTACAGCTTGGGCCCTCTAGCTCCATCAAGG 

AAAAGCAGGGGCCCCTTCTGGACCTGTTTGGCCAGAAGCTGCCTATTGCCCACACACCCCCACCTC 

CACCAGCGCCACCACTGCCTCTGCCCGAGGACCCAGGGACCCTTTCAGCAGAGCGTCGTTGCTTGA 

CACAGCCCGTGGAGGACCAGGGGGTCTCCACCCAGCTACTCGCGCCCTCTGGCAGCGTGTGCTTCT 

CCTACACCGGCACGCCCTGGAAGTTGTTCCTACGCAAGGAGGTGTTCTACCCACGGGAGAACTTCA 

GCCATCCCTACTACCTGAGGCTCCTCTGTGAGCAGATCCTACGGGACACCTTCTCCGAGTCCTGTA 

TCCGGATTTCCCAGAATGAGCGGCGGAAAATGAAAGACCTGCTGGGAGGCTTGGAGGTGGACCTGG 

ATTCTCTCACCACCACCGAAGACAGCGTCAAGAAGCGCATCGTGGTGGCCGCTCGGGACAACTGGG 

CCAATTACTTCTCCCGCTTCTTTCCTGTCTCGGGCGAGAGTGGCAGCGACGTGCAGCTGTTAGCCG 

TGTCCCACCGTGGGCTGCGACTGCTCAAGGTGACCCAAGGCCCCGGCCTCCGCCCCGACCAGCTGA 

AGATTCTCTGCTCATACAGCTTTGCGGAGGTGCTGGGTGTGGAGTGCCGGGGCGGCTCCACCCTGG 

AGCTGTCACTGAAGAGCGAGCAGCTGGTGCTGCACACAGCCCGGGCAAGGGCCATCGAGGCGCTGG 

TTGAGCTATTCCTGAATGAGCTTAAGAAGGACTCCGGCTATGTCATCGCCCTGCGCAGCTACATCA 

CTGACAACTGCAGCCTCCTCAGCTTCCACCGTGGGGACCTCATCAAGCTGCTGCCGGTGGCCACCC 

TGGAGCCAGGCTGGCAGTTTGGCTCTGCCGGGGGCCGTTCCGGACTCTTTCCTGCCGACATAGTGC 

AGCCGGCTGCCGCTCCCGACTTTTCCTTCTCCAAGGAGCAGAGGAGTGGCTGGCACAAGGGTCAGC 

TGTCCAACGGGGAACCAGGGCTGGCTCGGTGGGACAGGGCCTCAGAGCGCCCTGCCCACCCTTGGA 

GCCAGGCACACAGTGACGACTCGGAGGCCACCAGCCTGTCCTCTGTGGCCTATGCCTTTCTGCCCG 

ACTCCCACAGCTACACCATGCAGGAATTCGCCCGGCGTTACTTCCGGAGGTCCCAGGCCTTGCTGG 

GCCAGACTGATGGAGGTGCCGCAGGAAAGGACACGGACAGCCTGGTGCAGTACACCAAGGCTCCCA 

FIG. 7A. 
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TCCAGGAGTCGCTCCTCAGCCTCAGTGATGATGTGAGCAAGCTGGCTGTAGCCAGCTTCCTGGCCC 

TGATGCGGTTTATGGGTGACCAGTCCAAGCCCCGGGGCAAGGATGAGATGGATCTGCTCTATGAAC 

TGCTGAAGCTGTGCCAGCAGGAGAAGCTGAGGGATGAGATTTACTGCCAGGTTATCAAGCAGGTCA 

CAGGACACCCCCGGCCGGAACACTGCACTCGAGGCTGGAGC.TTCCTCAGCCTTCTCACAGGCTTCT 

TCCCCCCGTCGACCAGGCTGATGCCCTACCTGACCAAGTTTCTGCAGGATTCAGGCCCCAGCCAAG 

AGCTGGCCCGGAGCAGCCAGGAGCACCTCCAGCGCACAGTCAAATATGGGGGGCGCCGGCGGATGC 

CCCCACCGGGTGAAATGAAGGCTTTCCTGAAAGGACAAGCGATTCGCCTGCTTCTTATTCACCTGC 

CGGGGGGTGTGGATTATAGGACGAATATCCAGACTTTCACAGTAGCAGCAGAAGTGCAGGAGGAGC 

TGTGCCGGCAAATGGGTATCACGGAGCCTCAGGAAGTGCAGGAATTCGCCCTCTTCCTCATCAAAG 

AGAAGAGCCAGCTGGTGCGGCCCCTGCAGCCCGCCGAATACCTCAACAGCGTGGTAGTGGACCAGG 

ACGTGAGCCTGCACAGCGGCGGCTCCACTGGGAGACCCCACTGCACTTCGATAACTCCACCTACAT 

CAGCACCCACTACAGCCAGGTGCTGTGGGACTACCTTCAGGGGAAGCTGCCAGTCAGCGCCAAGGC 

AGACGCGCAGCTCGCCAGGCTGGCCGCCCTGCAGCACCTCAGCAAGGCCAACAGGAATACCCCCTC 

AGGGCAGGACCTGCTAGCTTACGTGCCAAAGCAGCTGCAACGGCAGGTGAACACGGCCTCCATCAA 

GAACCTGATGGGTCAGGAGCTGAGACGGCTGGAAGGACACAGCCCCCAGGAAGCACAGATCAGCTT 

CATTGAGGCCATGAGCCAGCTGCCCCTCTTCGGCTACACCGTCTATGGGGTGCTGCGAGTGAGCAT 

GCAGGCCCTGTCCGGACCCACTCTCCTGGGGCTCAACCGCCAGCATCTCATCCTCATGGACCCCAG 

CTCCCAGAGCCTGTACTGCCGCATTGCCCTGAAGAGCCTGCAGCGGCTCCACCTGCTAAGCCCTCT 

GGAGGAGAAGGGGCCCCCTGGCCTGGAAGTCAACTATGGCTCAGCTGACAACCCCCAGACCATCTG 

GTTTGAGCTGCCACAGGCCCAGGAGCTGCTATACACCACTGTCTTCCTGATAGACAGCAGTGCCTC 

TTGCACTGAGTGGCCCAGCATCAACTGAGAGGAGTGCAGGCCGGGGAGAGAAGAGGATGAGGCCTC 

CCCCGGCCCAAGTCTCACCCACATGGTCTGCCTTGGATGCTATCAGATCACTGTTCTAGAACCTGC 

CTCAGCACAGCCCAGCCGGCCCACATGCAGGCCATGAGGCAGGGGCTGCTATCACGTCACCAGCAG 

GCAAAGAAAACAGCCAGACCCTCTCCAGGACGGCCTGGGGCCAAAGCGGGCTGCAGGAACTCGGCT 

GGGGCACCTGAGGTTGCCCAGTCTGAGGGAGATGCCCACCCGACCCCAGGCTCCGCCCAGGCCCCA 

CATTAGCACAAGCCCAGGCATGGGAGAAACAGCTGCTGAGGAAATAAAACTCCCTAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAA 

FIG. 7B. 
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